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Abstract. A search for the rare decays B" ^ and fi" ^ fi*fj is performed with the LHCb experiment using 
1.1 fb"' of data collected at V* = 8TeV and 1.0 fb"' of data collected at = 7TeV. An excess of B, — > u*li 
candidates with respect to the background expectations is observed with a statistical significance of 3.5 standard 
deviations. A branching fraction of 3(5° —> fj.*fi~) = (3.2*[ 2) x 10~' is measured with an unbinned maximum 
likeUhood fit. The measured branching fraction is in agreement with the expectation from the Standard Model. 
The observed number of B" -> ft*i^~ candidates is consistent with the background expectation and an upper 
limit on the branching fraction of S(_Bf -> fi*n~) < 9.4 x 10"'° is obtained. 



One of the most important goals of the LHCb experiment 
at the LHC is to search for phenomena that cannot be ex- 
plained by the Standard Model (SM) of particle physics. 
Precise measurements of the branching fractions of the 
two Flavour Changing Neutral Current (FCNC) decays 
fi'' — > yU"^jU~ and 5" — > belong to the most promising 
of these searches. Both decays are strongly suppressed by 
loop and helicity factors, making the SM branching frac- 
tion small [1] 

S(B?^/iV) = (3.23 ± 0.27) X 10"^ and (1) 

S(5"^;uV) = (0.11 ± 0.01) X 10"'* . (2) 

These theoretical predictions are the CF-averaged branch- 
ing fractions. As pointed out in Ref. [2], the finite width 
difference of the fi'' system needs to be considered. The 
time integrated branching fraction is evaluated to be 



S(B? ^/^VW» = 3.4x10- 



(3) 



for which the SM prediction (Eq. 1) and the LHCb mea- 
surement of the width difference AFj [3] are used. This is 
the expected value which is to be compared with an exper- 
imental measurement. 

Enhancements of the branching fractions of these de- 
cays are predicted in a variety of different extensions of 
the Standard Model, an overview is given in Ref. [4]. In 
one popular example, the Minimal Supersymmetric Stan- 
dard Model (MSSM), the enhancement is proportional to 
tan^yS, where tan/? is the ratio of the vacuum expectation 
values of the two Higgs fields. For large values of tan/?, 
this search belongs to the most sensitive probes for physics 
beyond the SM which can be performed at collider experi- 
ments. A review of the experimental status of the searches 
for Bf^^j fi^fT can be found in [5]. 
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2 Dataset and analysis strategy 

The measurements [6] presented here use data recorded by 
the LHCb experiment: 1 fb"', recorded in 2011 at an cen- 
ter of mass energy of ^/s = 7TeV, combined with 1.1 fb"^ 
of data recorded in 2012 at ^fs = 8TeV. The first part 
of the dataset has already been analyzed [7] and was used 
to produce the lowest published limit on the decay rate of 
both decays 5^ — > fi'^fJ.~ and Z?"— > /j^/U". 

Two main improvements have been implemented over 
the previous analysis: the use of particle identification to 
select B°^j — > h^h'~ decays which are used to calibrate 
the geometrical and kinematic variables, and a refined esti- 
mate of the exclusive backgrounds. The updated estimate 
of the exclusive backgrounds is also applied to the 2011 
data and the results reevaluated. The results obtained with 
the combined 2011 and 2012 data sets supersede those of 
Ref. [7]. 

Candidate — > ii'^i^' events are required to be se- 
lected by a hardware and a subsequent software trigger [8], 
dominantly by single and dimuon lines. The first step of 
the final analysis is a simple selection, which removes the 
dominant part of the background and keeps about 60% of 
the reconstructed signal events. A second selection step, 
based on a Boosted Decision Tree (BDT) reduces 80% of 
the remaining background while retaining 92% of the sig- 
nal. More details on the selections are given in Ref. [6] 

2.1 Signal discrimination 

Each event is then given a probability to be signal or back- 
ground in a two-dimensional probability space defined by 
the dimuon invariant mass and a multivariate discriminant 
operator This likelihood combines kinematic and topo- 
logical variables of the B^^^ decay using a BDT. The BDT 
is defined and trained on simulated events for both signal 



EPJ Web of Conferences 




Figure 1. BDT distribution for the 2012 dataset, for the sig- 
nal (black squares) and combinatorial background (blue open 
points). Values normalized to the bin size. 



and background. The signal BDT shape is then calibrated 
using decays of the type Sj'^j — > h^h'~, where h- represents 
a K- or JT-. These decays have an identical topology to the 
signal. The calibrated BDT signal and background shape 
is shown in Fig. 1 . It is designed to be flat in the signal, 
whereas the shape in the background falls over four orders 
of magnitude. 

The invariant mass line shape of the signal events is 
described by a Crystal Ball function [9]. The mass resolu- 
tion is calibrated with a combination of two methods: an 
interpolation of i/^(25) and T(15), T(25') and T(35) 
decays to two muons and from exclusive S° ^ — > h'^h'~ 
samples. The results are o-go = 25.0 ± 0.4MeV/c^ and 
(Tgo = 24.6 ± 0.4MeV/c-. The transition point of the ra- 
diative tail is obtained from simulated S*' — > events 
smeared to reproduce the mass resolution measured in the 
data. 

The background shapes are calibrated simultaneously 
in the mass and the BDT using the invariant mass side- 
bands. This procedure ensures that even though the BDT 
is defined using simulated events, the result will not be bi- 
ased by discrepancies between data and simulation. 

2.2 Binning 

The binning of the BDT and invariant mass distributions 
is optimized using simulation, to maximize the separa- 
tion between the median of the test statistic distribution 
expected for background and SM i?j — > /^^yu" signal, and 
that expected for background only. The chosen number 
and size of the bins are a compromise between maximiz- 
ing the number of bins and the necessity to have enough 
■^M ~^ events to calibrate the S'^ ^'^fi~ BDT and 

enough background in the mass sidebands. 



3 Normalization 

The number of expected signal events is evaluated by 
normalizing with channels of known branching fraction. 



Two independent channels are used: B~ — > J/i//K~ and 
fi° — > K'^7T~ . The first decay has similar trigger and muon 
identification efficiency to the signal but a diff^erent num- 
ber of particles in the final state, while the second chan- 
nel has the same two-body topology but is selected with 
a hadronic trigger. The event selection for these channels 
is specifically designed to be as close as possible to the 
signal selection. The normalization for 5° — > and 



is then given as 



tsig fd(s) 



^ norm 



(4) 



where /^(s) and /,omi are the probabilities that a b quark 
fragments into a fij^^^ and into the hadron involved in the 
given normalization mode respectively. The recently up- 
dated value fjfd = 0.256 ± 0.020 [10] is used. S„orm indi- 
cates the branching fraction and A'norm the number of sig- 
nal events in the normalization channel obtained from a fit 
to the invariant mass distribution. The efficiency esig(nonn) 
for the signal (normalization channel) is the product of the 
reconstruction efficiency of all the final state particles of 
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Figure 2. Invariant mass distribution of B~ —> J/ ifrK~ (top) and 
B° K*n~ (bottom) candidates. The fit to data is superim- 
posed in blue while the blue dotted line is the combinatorial back- 
ground. In the fit to the B" K*7i- data, the 6" K^n' com- 
ponent (red line), the fi" K*7t~ (green dashed line), and the 
partially reconstructed background (black dotted line) are also 
shown. 
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the decay including the geometric acceptance of the detec- 
tor, the selection efficiency for reconstructed events, and 
the trigger efficiency for reconstructed and selected events. 
Ngi ^^f_i*fi- is the number of observed signal events. The 
ratios of reconstruction and selection efficiencies are esti- 
mated from the simulation, while the ratios of trigger effi- 
ciencies on selected events are determined from data. 

The fit to the two normalization channels is shown in 
Fig. 2. The observed numbers of B~ — > J/iJ/K~ and 5" — > 
K+7T- candidates are 424 222 ± 1 452 and 14 579 ± 1110. 
The two normalization factors are in agreement within the 
uncertainties and their weighted average, taking con-ela- 
tions into account, is 

ff™™ = (2.80 ± 0.25) X 10-'° and 

Q-flo'™ = (7.16 ± 0.34) X 10"", (5) 

for S*' — > yU^yu" and 5" — > candidates inside a sig- 
nal window of ±60 MeV/c^ around the mass central value. 
These normalization factors are used for the limit compu- 
tation. The normalization factors in the full mass range, 
used in the fit for the branching fraction, are 10% lower. 

4 Background characterization 

Partially reconstructed decays of beauty mesons or 
baryons can pollute the low mass sidebands. The domi- 
nant modes are: 

• B° ^ tt'V""/^" and 5+ 7r+yu+yu" 

• ^ h+h'- (with/j<'' = K,7t) and 

• Afc -> pi^'v . 

In some of these modes kaons, pions and protons are 
misidentified as muons. The contributions of these de- 
cays to the S°^^ — > yu^/i" analysis is estimated from Monte 
Carlo simulated samples by folding the — > .tt — > 
and /7 — > /i fake rates extracted from — > K^7t~ and 
A — > p7T~ data samples into the spectrum of simulated 
events. The fractional yields in BDT bins and the parame- 
ters that describe the mass lineshape are used as nuisance 
parameters in the unbinned maximum likelihood fit used 
to determine the branching fraction. 

The exclusive background contribution in the most 
sensitive region of the analysis (BDT> 0.8) is dominated 
by the S" ^ 7z-fi+v, 5^^^ ^ h+h'- and fi"-+ ^ 7r»-+yU+/|- 
decays. 

5 Results 

The observed pattern of events in the 15 BDT bins (8 in 
the 201 1 data and 7 in the 2012 data) is shown in Fig. 3 for 
— > (top) and S" — > (bottom) together with 
the fit for the branching fraction, which includes compo- 
nents for the exclusive background components discussed 
in Sec. 4. 

The number of expected combinatorial background 
events in the 5*' and 5° search windows is determined 
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Figure 4. CLs as a function of the assumed S(f) or B" 
tJ.*fJ.~ (top) and Bj —> ti*fJ.~ (bottom) decays for the combined 
2011+2012 dataset. The long dashed gray curves are the medi- 
ans of the expected CLs distributions if background and SM sig- 
nal were observed. The yellow area covers the Icr area around 
the median. The solid red curves are the observed CLs. For the 
P*IJ~ (bottom), the long dashed gray curve in the green 
area is the expected CLs distribution if background only was ob- 
served with its icr interval. 



from a simultaneous unbinned likelihood fit to the mass 
projections in the BDT bins. The same fit is then per- 
formed on the full mass range to extract the 5*' — > ji^iT 
and if — > n'^'tr branching fractions. 

In this fit the parameters that describe the mass distri- 
butions of the exclusive backgrounds, their fractional yield 
in each BDT bin and their overall yields are constrained to 
vary within ±\cr with respect to the expected values. The 
combinatorial background is parameterized with an expo- 
nential function with a slope and a normalization which 
are free parameters of the fit. 

The S° — > yU"^yU" and if — > n^^T signal yields are free 
parameters of the fit. Their fractional yields in BDT bins 
are constrained to the BDT fractions calibrated with the 
— > h'*'h'~ sample and the parameters of the Crystal 
Ball functions that describe the mass lineshape are con- 
strained to vary within ±lcr with respect to the expected 
values. 

The systematic uncertainties in the exclusive back- 
ground and signal predictions in each bin are computed 
by fluctuating the mass parameters, the BDT fractional 
yields and the normalization factors along the Gaussian 
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Figure 3. Simultaneous fit of the invariant mass distribution in the 8 BDT bins of 201 1 (top) and 7 BDT bins of 2012 data (bottom). 
The fit result is superimposed in blue, the individual components are given in the legend. 



distributions defined by their associated uncertainties. The 
systematic uncertainty on the estimated number of com- 
binatorial background events in the search windows is 
computed by fluctuating with a Poissonian distribution the 
number of events measured in the sidebands, and by vary- 
ing the value of the exponent accordingly to the its uncer- 
tainty. 

The compatibility of the observed distribution of 
events with a given branching fraction hypothesis is com- 
puted using the CLs method [11, 12]. The pattern observed 
for — > decays is compatible with the background 
only hypothesis. The CLs curve is shown in Fig. 4 (top). 
The observed CLb value at CL,+i, = 0.5 is 89%. 



An excess of S° — > yu"^//" candidates is observed, 
the CLs curve to evaluate its significance is shown in 
Fig. 4 (bottom). The probability that background pro- 
cesses can produce the observed number of — > fi^/J-' 
candidates or more is 5 x 10"'* and coiresponds to a statisti- 
cal significance of about 3.5 standard deviations. The val- 
ues of the — > //"^yu" branching fraction extracted from 
the fit is S(B° fi^fj-) = (3.2+[|) x 10-^ in good agree- 
ment with the SM prediction. 
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6 Conclusions 



A search for the rare decays 5j — > ii^ji and B" — > iX^n 
has been performed with 1.1 fb"^ of data collected at 
= 8 TeV and 1 .0 fb" ^ of data collected at = 7 TeV. The 
data in the BP search window are consistent with the back- 
ground expectations and an upper limit of !B{lf — > n^iT) 
< 9.4 X lO"!" is obtained at 95% CL. This is the most 
stringent published limit on this decay rate. The data in 
the 5j search window show an excess of events with re- 
spect to the background expectation with a statistical sig- 
nificance of 3.5 cr. A branching fraction of — > n^n') 
= (3.2;|;j 2) x 10"^ is measured. This is the first evidence of 
the ^ jX^lT decay. 

The next step is a precision measurement of the decay 
rate of 5° — > iX^jT and then to limit and then measure the 
ratio of the decay rates of — > iX^jTl^ — » jjtjJT ■ This 
ratio allows a stringent test of the hypothesis of minimal 
flavor violation and a good discrimination between various 
extensions of the Standard Model. 

It should be stated that the precise measurement of 
— > provides complementary information to 

the searches performed at high px experiments. 
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1 capri proceedings 

Precise measurements of the branching fractions of the 
two FCNC decays — > fi^fi' and — > ju'^fi' belong to 
the most promising measurements for a possible discov- 
ery of a theory beyond the SM. These decays are strongly 
suppressed by loop and helicity factors, making the SM 
branching fraction small [? ]: S(B? ji^tr) = (3.23 ± 
0.27) X 10-9 and S{B^ -> /xV) = (O.H ± 0.01) x lO'^. 
Taking the finite width difference of the system into ac- 
count, the time integrated branching fraction is evaluated 
to be [? ] 

S(B?^/^V)sM,» = 3.4xlO-^ (1) 

Enhancements of the decay rates of these decays are pre- 
dicted in a variety of different New Physics models, a sum- 
mary is given in Ref. [? ]. For example, in the minimal su- 
persymmetric Standard Model (MSSM), the enhancement 
is proportional to tan*/?, where tan/J is the ratio of the vac- 
uum expectation values of the two Higgs fields. For large 
values of tany6, this search belongs to the most sensitive 
probes for physics beyond the SM which can be performed 
at collider experiments. A review of the experimental sta- 
tus of the searches for — > jX^jjT can be found in [? 
]. 

The measurements presented here use 1 fb"' of data 
recorded by the LHCb experiment in 2011. Assuming the 
SM branching ratio, about 12 (1.3) B^ (B") decays are ex- 
pected to be triggered, reconstructed and selected in the 
analyzed dataset. 

The first step of the analysis is a simple selection, 
which removes the dominant part of the background and 
keeps about 60% of the reconstructed signal events. As 
second step, a preselection, based on a Boosted Decision 
Tree (BDT) reduces 80% of the remaining background 
while retaining 92% of the signal. 

Each event is then given a probability to be signal or 
background in a two-dimensional probability space de- 
fined by the dimuon invariant mass and a multivariate dis- 
criminant likelihood. This likelihood combines kinematic 
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and topological variables of the B|^j decay using a BDT. 
The BDT is defined and trained on simulated events for 
both signal and background. The signal BDT shape is then 
calibrated using decays of the type Bj'^^ — > h'^h'~, where 
h- represents a K- or n-. These decays have an identical 
topology to the signal. The invariant mass resolution is 
calibrated with an interpolation of J/i^, (^(25) and T(15'), 
T(25') and T(35') decays to two muons. The background 
shapes are calibrated simultaneously in the mass and the 
BDT using the invariant mass sidebands. This procedure 
ensures that even though the BDT is defined using simu- 
lated events, the result will not be biased by discrepancies 
between data and simulation. 

The number of expected signal events is evaluated by 
normalizing with channels of known branching fraction. 
Three independent channels are used: — > J/i//K^, 
Bj — > and B" — > K'^7t~. The first two decays have 

similar trigger and muon identification efficiency to the 
signal but a different number of particles in the final state, 
while the third channel has the same two-body topology 
but is selected with a hadronic trigger. The event selection 
for these channels is specifically designed to be as close 
as possible to the signal selection. The ratios of recon- 
struction and selection efficiencies are estimated from the 
simulation, while the ratios of trigger efficiencies on se- 
lected events are determined from data The observed pat- 
tern of events in the high BDT range is shown in Fig. 1 for 
Bj — > fi^fi~ (top) and B" — > fi'^fi~ (bottom). A moderate 
excess over the background expectations is seen in the B*j 
channel. This excess is consistent with the SM prediction. 
No excess is seem in the B" channel. 

The compatibility of the observed distribution of 
events with a given branching fraction hypothesis is com- 
puted using the CLs method [? ? ]. The measured up- 
per limit for the branching ratio is at 95% confidence level 
(CL) 

S(B°^^V)L//ci < 4.5x10-' and (2) 
S(B"^y^V)L//cA < 1.0xlO-^ (3) 

which are the worlds best upper exclusion limit on the 
branching fraction of this decay. A combination [? ] of 
this measurement with the ATLAS and CMS upper exclu- 
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sion limits yields at 95% CL 
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S{B°^ ti^li-)LHC < 4.2x10-' and (4) 
S(B°^ IJ^IJ-)lhc < 0.8xlO-^ (5) 
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which is only a factor 20% above the SM prediction given 
in Eq. 1. This puts tight constraints on various exten- 
sions of the Standard Model, especially on supersymmet- 
ric models at high values of tan /J. 



The CMS and LHCb collaborations have excellent 
prospects to observe the decay 5*' — > /j.'^'/j.' with the dataset 
collected in 2012. This observation, and the precision 
measurement of S(S^ — > ji^ji') in the coming years will 
allow to fJfti|tiipi}§eS8li^ft|j«g^}ji}a^lag of any 

extension of the Standard Model. The next step will be 
to limit and tlf^tmeasuisfeibfid'atiotMfdhe decay rates of 
yu"'yU-/Zji'umbe^<"'y"nuiffltegh iA\jmScf^ stringent test 
of the hypoth{}ttSriffefninjjieJbSpvoinSj@bajon and a good 
discrimination between various extensions of the Standard 
Model. 



In the absence of an observation, limits on 
fi'^fi~) are complementary to those pro- 



Figure 1. Distribution of selected signal candidates. Events 
observed in LHCb in the B° channel (top) and the B" channel 
(bottom) for BDT> 0.5 and expectation for, from top, SM sig- 
nal (grey), combinatorial background (light grey), B°^,^ —> h*h'~ 
background (black) and cross-feed between both modes (dark 
grey). The hatched area depicts the uncertainty on the total back- 
ground expectation. Figure reproduced from Ref. [? ]. 

vided by high pi experiments. The interplay between 
both channels allows the SUSY parameter space to be 
optimally constrained. 
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